AI032
Programming Massively Parallel Processors: A Hands-on Approach
Advanced CUDA Threading and Scheduling
Learning Objectives
- Analyze the GigaThread engine's role in global block distribution across Streaming Multiprocessors.
- Evaluate the impact of warp scheduling and instruction dispatch on pipeline utilization.
- Optimize kernel performance by balancing register pressure and shared memory against occupancy.
- Master advanced synchronization primitives and cooperative group execution patterns.